Associating an object with a subject

ABSTRACT

A method comprises receiving one or more depth images from a depth camera, the depth images indicating a depth of a surface imaged by each pixel of the depth images. The method may further comprise identifying a human subject imaged by the depth images and recognizing within one or more of the depth images a beacon emitted from a control device. A position of the control device may be assessed in three dimensions, and the control device may be associated with the human subject based on a proximity of the control device to the human subject or other parameter of the control device with relation to the human subject.

BACKGROUND

Association of a control device with a specific user typically involves multiple levels of interaction between the control device, the user, and a computing system. The association may also remain until a user specifically requests to update the control device with a new association.

SUMMARY

Embodiments are disclosed herein for providing a method to intelligently associate an object with a subject within an operating environment. For example, a computing device may receive one or more depth images from a depth camera, the depth images indicating a depth of a surface imaged by each pixel of the depth images. The computing device may utilize the depth images to identify a human subject imaged by the depth images and recognize within one or more of the depth images a beacon emitted from a control device. The computing device may assess a position of the control device in three dimensions to determine a relative location of the control device and associate the control device with the human subject based on a proximity of the control device to the human subject and/or any other suitable parameter of the control device in relation to the subject.

This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 schematically shows a non-limiting example of an environment including a computing system and one or more subjects and objects in accordance with an embodiment of the present disclosure.

FIG. 2 shows a method of associating an object with a subject in accordance with an embodiment of the present disclosure.

FIG. 3 schematically shows a nonlimiting example of a skeletal tracking pipeline in accordance with an embodiment of the present disclosure.

FIG. 4 shows a method of binding a control device to a computing system in accordance with an embodiment of the present disclosure.

FIG. 5 schematically shows an example communication diagram for an environment including a computing system and two or more control devices in accordance with an embodiment of the present disclosure.

FIG. 6 is an example computing system in accordance with an embodiment of the present disclosure.

DETAILED DESCRIPTION

The present disclosure is directed to intelligently associating a control device, such as a game controller, with a game player or another subject. The control device may emit a beacon, and a computer may use the beacon to determine a three-dimensional position of the control device. The three-dimensional position of one or more subjects may be assessed, and the control device is automatically associated with one such subject based on the proximity of the control device to that subject and/or any other suitable parameter of the control device in relation to the subject.

FIG. 1 shows an example environment 100 including computing system 102 and control devices 104, 106, 108, and 109. Although environment 100 is illustrated as a room, environment 100 may be any suitable physical space, including indoor and/or outdoor environments. Computing system 102 may be configured to receive input from and/or communicate with control devices 104, 106, 108, and 109. Control devices in accordance with the present disclosure may include any suitable control device, including but not limited to game controllers, audio headphones and/or headsets, remote controls, musical instruments (e.g., guitar, drums, etc.), steering wheel controllers, flightsticks, weapons (e.g., sword, laser sword, pistol, rifle, saber, halberd, nunchaku, etc.), mobile computing devices, etc. In the illustrated embodiment, control devices 104, 106, and 108 take the form of game controllers and control device 109 takes the form of an audio headset. Computing system 102 may include an imaging device, such as depth camera 110. For example, depth camera 110 may include one or more sensors, lens elements, and/or light sources that are configured to image environment 100. Depth camera 110 may utilize any suitable depth imaging technology, including but not limited to a time-of-flight depth camera and/or a structured light depth camera. Depth camera 110 may include additional sensors, including but not limited to a visible light camera and/or one or more microphones. Depth camera 110 may generate and send depth images to computing system 102. The depth images may indicate a depth of a surface imaged by each pixel of the depth images in order to provide information regarding one or more features of environment 100.

As further illustrated in FIG. 1, environment 100 may include one or more human subjects, such as game player 112 and game player 114, and one or more non-human subjects, such as table 116. In some embodiments, one or more users, such as game player 112, may interact with computing system 102. For example, Computing system 102 may provide visual output to game player 112 in any suitable manner, such as through display 122 of display device 124. Computing system 102 may also provide audio output to game player 112 in any suitable manner, including through one or more audio devices, such as audio headphones 126 of control device 109 and/or speakers of display device 124.

Game player 112 may provide input to computing system 102 via one or more user input devices, such as control device 104, depth camera 110, microphone 118 of control device 109, and/or any other suitable user input device. User input devices, such as those described above, may communicate with computing system 102 in any suitable manner, including but not limited to wired and/or wireless configurations. For example, a control device may provide user input to computing system 102 in the form of commands that are sent over a radiofrequency (RF) communications channel. In some embodiments, each of a plurality of control devices and/or other objects may communicate over a particular RF channel and/or provide a unique identifier during communication in order to distinguish a particular control device from another control device.

Control devices and/or other objects may also be distinguished from one another via a round-robin and/or time-sliced scheduling of message transmission. For example, multiple control devices may be assigned a time slot during which it may blink or otherwise transmit a signal. In response, the computing system may distinguish each control device by determining a time and/or order of received signals and associating the source of the signal with the control device assigned to that time and/or order value. Accordingly, the computing system may detect and associate the beacon that it observes with an object that it communicates with via another communications medium, such as RF, wired cable, etc.

Objects, such as control devices, user input/output devices, and the like, may also be configured to interact with computing system 102 by emitting a beacon. For example, as illustrated in FIG. 1, control device 106 may emit beacon 128. Beacon 128 may comprise any suitable beacon, including but not limited to an audio and/or optical beacon. For example, beacon 128 may include an infrared light beacon, a visible light beacon, a sonar beacon, and/or any suitable electromagnetic signal. In some embodiments, the beacon may only be emitted in response to a request or other message from a computing system. Beacon 128 may be used to determine a source of user input and/or a location of an object. Determining a source of user input may allow computing system 102 to provide a dynamically adjustable user experience. For example, content of an application may be customized based on a particular user providing input. In some embodiments, content of an application may be tied to a control device. For example, an audio output of an application providing sound to an audio headphone may be adjusted based on a position of the audio headphone.

Identifying a source of user input may also enable cooperation between multiple user input and/or control devices. For example, an audio headset may be associated with a control device such that audio input provided to the audio headset may supplement input provided to the control device. Gestures that are detected by a control device, depth camera, and/or other suitable gesture detection device may also be associated with a particular control device such that gesture-based commands may supplement input provided to the control device. For example, computing system 102 may determine that a gesture is performed by a particular user and/or in a particular location and associate the gesture with a control device based on the determined user and/or location.

Accordingly, in multi-user environments, a first user, such as game player 112 may provide input via control device 104 and control device 109 while a second user, such as game player 114 provides input via control device 108. By associating each control device with a respective user, computing system 102 may provide a multiplayer gaming experience. For example, user input provided by control device 104 and/or control device 109 may be applied to a first player representation 130 and user input provided by control device 108 may be applied to a second player representation 132. Likewise, audio associated with a first player representation may be directed to headphones 126 of control device 109, while audio associated with a second player representation may be directed to speakers of display device 124 and/or another suitable audio device.

FIG. 2 shows an example method 200 of associating an object with a subject according to an embodiment of the present disclosure. Method 200 may be performed by a computing system, such as computing system 102 of FIG. 1. At 202, method 200 includes receiving one or more depth images from a depth camera. For example, computing system 102 may receive depth images from depth camera 110 of FIG. 1. In some embodiments, computing system 102 may include an infrared sensor to detect infrared signals. In such embodiments, the computing system may receive one or more infrared images from an infrared signal instead of or in addition to receiving one or more depth images at 202. In additional or alternative embodiments, the computing system may receive two-dimensional images from a visible light (e.g., RGB) camera, heat map, and/or other suitable image source alone or in combination with an infrared sensor.

At 204, method 200 includes identifying a subject imaged by the depth images. In some scenarios, the subject is a human subject, such as game player 112 of FIG. 1. In some scenarios, the subject may be an inanimate object, such as table 116 and/or control devices 104, 106, 108, and/or 109 of FIG. 1. For embodiments in which a computing system includes an infrared sensor, as discussed above, the computing system may identify a subject imaged by the infrared images. For example, the computing system may analyze an infrared reflectivity of each pixel of an infrared image in order to determine a position of an object, beacon, and/or subject. In some embodiments, the computing system may utilize an infrared sensor to track and determine a two-dimensional location and a depth camera to calculate a depth value for the two-dimensional location. The computing system may analyze virtually any type of electromagnetic information to determine a location of an object, beacon, and/or subject without departing from the scope of this disclosure.

In the case of a human subject, identification may be specific to a particular individual. In other words, the identity of a particular person may be identified. For example, computing system 102 of FIG. 1 may identify game player 112 as being “John,” where “John” may be described by a user profile stored locally and/or remotely on a storage machine of computing system 102 and/or a database. The user profile may include any suitable information to identify a game player, including but not limited to virtual skeleton data, facial recognition data, audio parameters, etc. The user profile may be locally saved and/or accessed via a network such as the Internet. In other cases, identification is not specific to a particular individual. For example, computing system 102 of FIG. 1 may identify and track game player 112 without differentiating game player 112 from any other game player.

In some embodiments, a computing system may differentiate a game player from other game players without directly identifying the game player. For example, the computing system 102 of FIG. 1 may perform facial recognition, described in more detail below, or any other identification technique, without tying the game player to a particular user profile. The identification may be utilized to associate the game player with a particular representation within an application and/or other context of the computing system, such that input provided by the game player, regardless of the input mechanism utilized, is provided to the particular representation. For example, a game player may be identified via facial recognition while providing input associated with a particular character in a game. Thereafter, even if the game player utilizes a different input device, such as a different controller, the input provided by the game player may be associated with the particular character.

As indicated at 206, identifying a human subject optionally may include modeling the human subject with a virtual skeleton. Skeletal modeling may optionally be used to find the particular identity of an observed subject. For example, one or more characteristics detected via skeletal model (e.g., bone lengths, bone posture, etc.) may be used to look up a user identity in a look up table or other database. Such modeling is described in more detail with reference to FIG. 3.

FIG. 3 graphically shows a simplified skeletal tracking pipeline 300 of a depth analysis system. For simplicity of explanation, skeletal tracking pipeline 300 is described with reference to computing system 102 of FIG. 1. However, skeletal tracking pipeline 300 may be implemented on any suitable computing system without departing from the scope of this disclosure. Furthermore, skeletal tracking pipelines that differ from skeletal tracking pipeline 300 may be used without departing from the scope of this disclosure.

At 302, FIG. 3 shows game player 112 of FIG. 1 from the perspective of a depth camera, such as depth camera 110 of FIG. 1. A depth camera, such as depth camera 110, may include one or more sensors that are configured to observe a human subject, such as game player 112. The depth camera may also observe a beacon 304 that is emitted from a control device, such as control device 104. For example, beacon 304 may include an optical beacon, such as a visible light beacon and/or an infrared light beacon.

At 306, FIG. 3 shows a schematic representation 308 of the observation data collected by a depth camera, such as depth camera 110. The types of observation data collected will vary depending on the number and types of sensors included in the depth camera.

A depth camera may determine, for each pixel of the depth camera, the depth of a surface in the observed scene relative to the depth camera. FIG. 3 schematically shows the three-dimensional x/y/z coordinates 310 observed for a DPixel[v,h] of a depth camera 110. Similar three-dimensional x/y/z coordinates may be recorded for every pixel of the depth camera. The three-dimensional x/y/z coordinates for all of the pixels collectively constitute a depth map. The three-dimensional x/y/z coordinates may be determined in any suitable manner without departing from the scope of this disclosure.

A visible light camera may determine, for each pixel of the visible light camera, the relative light intensity of a surface in the observed scene for one or more light channels (e.g., red, green, blue, grayscale, etc.). FIG. 3 schematically shows the red/green/blue color values 312 observed for a V-LPixel[v,h] of a visible light camera of depth camera 110. Similar red/green/blue color values may be recorded for every pixel of the visible light camera. The red/green/blue color values for all of the pixels collectively constitute a digital color image. The red/green/blue color values may be determined in any suitable manner without departing from the scope of this disclosure.

The depth camera and visible light camera may have the same resolutions, although this is not required. Whether the cameras have the same or different resolutions, the pixels of the visible light camera may be registered to the pixels of the depth camera. In this way, both color and depth information may be determined for each portion of an observed scene by considering the registered pixels from the visible light camera and the depth camera (e.g., V-LPixel[v,h] and DPixel[v,h]).

As illustrated at 316, beacon 304 may be configured such that observation data clearly differentiates the beacon from surrounding objects imaged by the depth camera. For example, beacon 304 may include an infrared light beacon that is represented by observation data as a series of white or bright pixels arranged in accordance with a beacon pattern emitted by control device 104.

One or more microphones may determine directional and/or nondirectional sounds coming from an observed human subject and/or other sources. FIG. 3 schematically shows audio data 314 recorded by a microphone, such as microphone 118 of control device 109. Such audio data may be determined in any suitable manner without departing from the scope of this disclosure. Sound source localization could also be used to find and/or locate a controlling device if instead of a light beacon, the control device produces a sound beacon.

The collected data may take the form of virtually any suitable data structure(s), including but not limited to one or more matrices that include a three-dimensional x/y/z coordinate for every pixel imaged by the depth camera, red/green/blue color values for every pixel imaged by the visible light camera, and/or time resolved digital audio data. While FIG. 3 depicts a single frame, it is to be understood that a human subject may be continuously observed and modeled (e.g., at 30 frames per second). Accordingly, data may be collected for each such observed frame. The collected data may be made available via one or more Application Programming Interfaces (APIs) and/or further analyzed as described below.

A depth camera and/or cooperating computing system optionally may analyze the depth map to distinguish human subjects and/or other targets that are to be tracked from non-target elements in the observed depth map. Each pixel of the depth map may be assigned a player index 318 that identifies that pixel as imaging a particular target or non-target element. As an example, pixels corresponding to a first player can be assigned a player index equal to one, pixels corresponding to a second player can be assigned a player index equal to two, and pixels that do not correspond to a target player can be assigned a player index equal to zero. Such player indices may be determined, assigned, and saved in any suitable manner without departing from the scope of this disclosure.

A depth camera and/or cooperating computing system optionally may further analyze the pixels of the depth map of a human subject in order to determine a part of that subject's body that each such pixel is likely to image. A variety of different body-part assignment techniques may be used to assess which part of a human subject's body a particular pixel is likely to image. Each pixel of the depth map with an appropriate player index may be assigned a body part index 320. The body part index may include a discrete identifier, confidence value, and/or body part probability distribution indicating the body part, or parts, to which that pixel is likely to image. Body part indices may be determined, assigned, and saved in any suitable manner without departing from the scope of this disclosure.

As one nonlimiting example, machine-learning can be used to assign each pixel a body part index and/or body part probability distribution. The machine-learning approach analyzes a human subject using information learned from a prior-trained collection of known poses. In other words, during a supervised training phase, a variety of different people are observed in a variety of different poses, and human trainers provide ground truth annotations labeling different machine-learning classifiers in the observed data. The observed data and annotations are used to generate one or more machine-learning algorithms that map inputs (e.g., observation data from a tracking device) to desired outputs (e.g., body part indices for relevant pixels).

At 322, FIG. 3 shows a schematic representation of a virtual skeleton 324 that serves as a machine-readable representation of game player 112. Virtual skeleton 324 includes twenty virtual joints—head, shoulder center, spine, hip center, right shoulder, right elbow, right wrist, right hand, left shoulder, left elbow, left wrist, left hand, right hip, right knee, right ankle, right foot, left hip, left knee, left ankle, and left foot. This twenty joint virtual skeleton is provided as a nonlimiting example. Virtual skeletons in accordance with the present disclosure may have virtually any number of joints.

The various skeletal joints may correspond to actual joints of a human subject, centroids of the human subject's body parts, terminal ends of a human subject's extremities, and/or points without a direct anatomical link to the human subject. Each joint may have at least three degrees of freedom (e.g., world space x, y, z). As such, each joint of the virtual skeleton may be defined with a three-dimensional position. For example, a left shoulder virtual joint 326 is defined with an x coordinate position 328, a y coordinate position 330, and a z coordinate position 332. The position of the joints may be defined relative to any suitable origin. As one example, a depth camera may serve as the origin, and all joint positions are defined relative to the depth camera. Joints may be defined with a three-dimensional position in any suitable manner without departing from the scope of this disclosure.

A variety of techniques may be used to determine the three-dimensional position of each joint. Skeletal fitting techniques may use depth information, color information, body part information, and/or prior trained anatomical and kinetic information to deduce one or more skeleton(s) that closely model a human subject. As one nonlimiting example, the above described body part indices may be used to find a three-dimensional position of each skeletal joint.

A joint orientation may be used to further define one or more of the virtual joints. Whereas joint positions may describe the position of joints and virtual bones that span between joints, joint orientations may describe the orientation of such joints and virtual bones at their respective positions. As an example, the orientation of a wrist joint may be used to describe if a hand located at a given position is facing up or down.

Joint orientations may be encoded, for example, in one or more normalized, three-dimensional orientation vector(s). The orientation vector(s) may provide the orientation of a joint relative to the tracking device or another reference (e.g., another joint). Furthermore, the orientation vector(s) may be defined in terms of a world space coordinate system or another suitable coordinate system (e.g., the coordinate system of another joint). Joint orientations also may be encoded via other means. As non-limiting examples, quaternions and/or Euler angles may be used to encode joint orientations.

FIG. 3 shows a nonlimiting example in which left shoulder virtual joint 326 is defined with orthonormal orientation vectors 334, 336, and 338. In other embodiments, a single orientation vector may be used to define a joint orientation. The orientation vector(s) may be calculated in any suitable manner without departing from the scope of this disclosure.

Joint positions, orientations, and/or other information may be encoded in any suitable data structure(s). Furthermore, the position, orientation, and/or other parameters associated with any particular joint may be made available via one or more APIs.

As seen in FIG. 3, virtual skeleton 324 may optionally include a plurality of virtual bones (e.g. a left forearm bone 340). The various skeletal bones may extend from one skeletal joint to another and may correspond to actual bones, limbs, or portions of bones and/or limbs of a human subject. The joint orientations discussed herein may be applied to these bones. For example, an elbow orientation may be used to define a forearm orientation.

The virtual skeleton may be used to identify a human subject by comparing the values associated with each element of the skeleton (e.g., relative locations of joints and/or bones) with known virtual skeleton values for a particular human subject. For example, a game player may be imaged and a virtual skeleton associated with that player may be stored in a look up table. Thereafter, when the game player is imaged, a generated virtual skeleton may be compared to the stored virtual skeleton. For example, a computing system may include a user profile for each member of a family, including, in order of decreasing virtual skeleton size, a father, mother, son, and daughter. In order to identify a particular user and select a profile associated with that user, a size of the virtual skeleton of the user may be compared to the size of the virtual skeleton associated with each user profile.

In some embodiments, the virtual skeleton may be used to compare a location of an object and/or a beacon, such as beacon 304 as represented at 342, to a location on a game player. For example, a position of beacon 304, as represented at 342, may be determined based on imaged surfaces near the beacon. Accordingly, an imaged surface representing a controller may be used to determine one or more coordinates associated with a detected beacon. Upon determining the position of the beacon, the proximity of beacon 304 or a known location of control device 104 to a hand or other joint of virtual skeleton 324 may be analyzed to determine whether the control device may be associated with game player 112. The association of the control device with a game player is discussed further with reference to FIG. 2 below.

In some embodiments, association of a control device and/or other object to a subject may be performed based on a suitable parameter of the control device in relation to the subject. For example, a game player may place different control devices in different location of an environment, such as environment 100 of FIG. 1. The game player may then perform a gesture or otherwise indicate a particular control device to which a user input is to be associated. Accordingly, a control device may be associated with a subject based on a gesture of the subject, even if another subject is in closer proximity to the control device. It is to be understood, however, that a virtual skeleton may be used for additional and/or alternative purposes without departing from the scope of this disclosure.

Turning back to FIG. 2, as indicated at 208, identifying the human subject optionally may include facial recognition. For example, facial recognition algorithms may be applied to depth images, raw infrared images, color images, and/or images from a visible light camera. Facial recognition may optionally be used to find the particular identity of an observed subject. For example, one or more characteristics detected via facial recognition (e.g., eye separation, eye-eye-mouth triangle, etc.) may be used to look up a user identity in a look up table or other database. To reduce computing power needed, the parts of images that are analyzed for facial recognition may be limited to an area around a head joint of a virtual skeleton, such as virtual skeleton 324 of FIG. 3.

Additionally, as indicated at 210, identifying the human subject optionally may include voice recognition. For example, audio input may be received from a microphone, such as microphone 118 of FIG. 1, and parameters associated with the audio input, such as frequency, amplitude, breathing patterns, etc, may be analyzed and/or compared to audio parameters associated with a user and stored in a look up table or other database to identify the user.

As indicated at 212, the method may optionally include outputting a message requesting an object to emit a beacon. For example, computing system 102 of FIG. 1 may request via one or more RF frequencies that one or more of the control devices 104, 106, and/or 108 emit a beacon so that the computing system may resolve a respective location of the control device in environment 100. In some embodiments, the computing system may request that the object emits a particular beacon. For example, computing system 102 may output a message to each object individually, requesting a different beacon from each object. Computing system 102 may also output one or more messages to assign time slices to the control devices for emitting the same beacon at different times. Likewise, computing system 102 may output any other suitable sequence of messages that will enable the system to detect and associate the beacon that it observes with an object that it communicates with via another communication medium, such as RF, wired cable, etc. In additional or alternative embodiments, computing system 102 may output and/or broadcast a general message that is not directed to a particular object, requesting an object to emit a beacon.

As indicated at 214, method 200 includes recognizing within one or more of the depth images and/or infrared images a beacon emitted from an object. A beacon may be recognized within depth images and/or infrared images in any suitable manner. For example, as shown in FIG. 3, a beacon from control device 104 may be represented in observational data, as schematically represented at 308, as a bright spot illustrated at 316 that is distinguishable from surrounding elements imaged by a depth camera. One or more parameters of the beacon may be analyzed to identify and distinguish a first object from a plurality of different objects. These parameters may include virtually any suitable parameter of the beacon, including but not limited to an intensity, a pattern, and/or a frequency of the beacon. The parameters may be used to find the first object in a look up table that includes one or more objects and one or more parameters associated with the objects. For example, a first object may emit a beacon with one or more different parameters from a beacon emitted by a second object. Objects may additionally or alternatively emit beacons in a time-slice and/or sequential manner such that a time and/or order parameter may be different between objects.

As indicated at 216, method 200 includes assessing a position of the object in three dimensions. For example, computing system 102 of FIG. 1 may utilize pixel information from depth camera 110 to determine x, y, and z coordinates of a control device within environment 100. The position of the object may be relative to the position of depth camera 110, a subject imaged by depth camera 110, and/or any other suitable reference point. In some embodiments, assessing the position of the control device may include identifying the control device within the depth images and/or infrared images and/or may include tracking the position of the control device over time to identify and interpret control gestures from the tracked position. In some embodiments, multiple beacons may be emitted at known locations on the controlling device in order to enable the computing system to compute the controller location. Additional beacons also add redundancy in case some of the beacons are occluded from the sensor. For example, fingers of a game player may be inadvertently placed over a location of a beacon emitter on the control device.

Method 200 further includes associating the object with the human subject based on a proximity of the control device to the human subject, as indicated at 218. For example, control device 104 of FIG. 1 may be associated with game player 112 due to the proximity of the control device to the game player. Conversely, control device 106 may not be associated with either game player 112 or 114, due to the distance between control device 106 and game players 112 and 114. Instead, control device 106 may be associated with table 116. The proximity may be determined based on any suitable threshold.

In some embodiments, the location of the object may be compared to a location of a particular part of a human subject. As indicated at 220, associating the object with the human subject may optionally include analysis of a hand joint position. Turning briefly to FIG. 3, a distance between a hand joint of virtual skeleton 324 and a location of control device 104 (e.g., as determined at 216 of method 200) may be compared to a distance threshold. Although the hand joint is analyzed at 220, it is understood that virtually any suitable joint and/or portion of the virtual skeleton may be analyzed to determine a distance between the joint and/or portion and an object. In some embodiments, an object may be associated with a human subject if the object is within a threshold distance of a hand joint of the human subject. However, the object may not be associated with the human subject if the distance between the object and the hand joint exceeds the threshold distance. In some embodiments without full skeletal tracking, a location of the head/face and the controller along the horizontal axis can be used to determine association.

Associating the control device with the human subject may include attributing a gesture input of the virtual skeleton and a control input of the control device to a same source. For example, such association may allow a user to provide multiple types of user input utilizing multiple devices to control a single player representation. Associating the object with the human subject may additionally or alternatively include applying a settings profile of the human subject to an application controlled by the object. In some embodiments, associating the object with the human subject may include presenting user-specific content via an application controlled by the object. For example, the user-specific content may be game content, media programming, advertising, and/or any suitable content selected for the human subject. In some embodiments, association of the object may change when passing an object from one user to another. For example, association may dynamically update based on a user that is nearest and/or providing input to the object.

A control device or other object may also emit a beacon to perform a binding operation with a computing system. For example, computing system 102 of FIG. 1 may not accept input from control device 106 until a binding operation is successfully completed. As used herein, the term “unbound” may refer to an object that is not bound to a particular computing system and/or an object that is not bound to any computing system.

FIG. 4 illustrates a method 400 of performing a binding operation at a computing system. At 402, method 400 optionally includes detecting an object. An object may be detected by a computing system in any suitable manner. For example, computing system 102 of FIG. 1 may detect an object, such as control device 106 in one or more depth images received from depth camera 110. Additionally or alternatively, computing system may detect an object based on a beacon emitted by the object, including but not limited to an optical beacon, an audio beacon, an RF beacon, etc. In some embodiments, the object may be detected at 406, described below.

As indicated at 404, method 400 includes determining if the object is bound to the computing system. For example, the computing system may identify the object, and determine whether the identified object is bound to the computing system. In some embodiments, the computing system may determine whether the object is bound to the computing system based on a beacon emitted by the object, such as the beacon detected at 406. For example, control devices that are not bound to the computing system may emit a beacon and/or may emit a particular beacon to advertise an unbound state. Therefore, if the object emits a beacon and/or emits a particular beacon, the computing system may determine that the object is not bound to the computing system. Conversely, if the object does not emit a beacon, the object may be determined to be bound to the computing system. If the object is bound to the computing system, no binding operation is necessary, therefore, the method returns to the start of method 400.

If the object is determined to be unbound, method 400 proceeds to 406, where the method includes detecting an optical beacon emitted from the object. For example, an object, such as control device 106 of FIG. 1, may automatically emit an optical beacon in response to changing states if the control device is not bound to computing system 102. Changing states may include changing from an inactive state to an active state. For example, changing from an inactive state to an active state may include changing from a power off state to a power on state. In some embodiments, changing from an inactive state to an active state may additionally or alternatively be performed as a result of detecting interaction with the control device by a user. Detecting interaction may include detecting actuation of a control element of the control device, detecting a signal from a sensor of the control device, and/or be performed in in any other suitable manner. For example, control device 106 may change from an inactive state to an active state when a user removes control device 106 from table 116.

The computing system may detect an optical beacon in a similar manner to those described above. For example, the computing system may recognize an optical beacon in one or more images and/or depth images received from a visible light camera and/or a depth camera. In some embodiments, the computing system may include an infrared sensor to detect an infrared light beacon and/or a visible light sensor to detect a visible light beacon that is emitted from the object. For example, a depth camera may include an infrared-pass filter in front of an infrared sensor in order to filter out visible light and allow infrared light having a particular range of wavelengths to be detected by the infrared sensor. Accordingly, an object may emit a beacon with a wavelength that is matched to an infrared-pass filter so that it may be detected by the depth camera.

As indicated at 408, method 400 includes binding the object to the computing system. For example, the computing system may automatically enter a binding mode upon detection of an optical beacon and/or an object that is unbound in order to bind the object to the computing system.

As indicated at 410, binding the object to the computing system may optionally include performing a handshake operation to verify the identity of the control device. The handshake operation may be performed in virtually any suitable manner. In some embodiments, performing a handshake operation may optionally include sending a request for a particular beacon pattern, as indicated at 412. For example, computing system 102 of FIG. 1 may send a request to control device 106 to emit a particular beacon pattern. The request may identify any suitable pattern, including but not limited to number, duration, and/or frequency of optical signals to be emitted from the control device. A control device may also include a plurality of light sources that may be independently controlled to emit a particular beacon. Accordingly, the request may identify a particular beacon pattern for one or more light sources of the plurality of light sources, such that a first light source may be requested to emit a different pattern than a second light source. If the requested beacon pattern and/or patterns are detected by the computing system, such as through analysis of depth images, the computing system may identify and/or determine the position of the control device.

Performing a handshake operation may additionally or alternatively include sending a request for an identifier associated with the object, as indicated at 414. For example, computing system 102 of FIG. 1 may send a request to control device 106 for an identifier associated with control device 106. Upon receiving an identifier for control device 106, computing system 102 may authenticate the control device and may bind the control device to the computing system. In some embodiments, computing system 102 may assign a unique identifier to the object to bind the object upon completing a handshake operation. Binding an object to a computing system may be performed in any suitable manner, including but not limited to storing an identifier for the object in a storage machine and/or database.

FIG. 5 illustrates example communications 500 between computing system 102 of FIG. 1 and two control devices during a binding operation. The communications may optionally begin with computing system 102 sending a request for control device A to emit a beacon at 502. In response or independently, control device A may emit a beacon at 504. For example, the computing system 102 may not be able to send a request to a control device when the control device is in an unbound state. Accordingly, control device A may emit a beacon at 504 when the control device is in an unbound state without receiving a request from the computing system. The beacon emitted at 504 may be a particular beacon pattern associated with a binding request and/or binding mode of the control device. Likewise, control device B may emit a beacon at 506. In response to receiving the beacon from either control device A or control device B, computing system 102 may enter a binding mode at 507 in order to bind control device A and/or control device B.

In some embodiments, binding may occur before the computing system provides any commands or requests to the control device. For example, computing system 102 may automatically bind with control device A upon receipt of the beacon emitted at 504. In additional or alternative embodiments, computing system 102 may initiate a handshake operation by requesting a particular beacon pattern from control device A at 508. For example, computing system 102 may determine or negotiate an RF channel over which control device A may communicate. In some embodiments, the beacon emitted at 504 and/or 506 may be a different type beacon than the beacon requested at 508. For example, the beacon emitted at 504 may be an infrared beacon, while the beacon requested at 508 may be an RF signal. In some embodiments, the RF channel may be specific to control device A. Computing system 102 may then specifically address control device A by sending a request for the particular beacon pattern over the channel. In response, control device A emits the particular beacon pattern at 510. Upon confirming that the beacon pattern emitted by control device A is the same and/or within a threshold of the requested particular beacon pattern, computing system 102 may bind control device A and/or confirm or finalize a previously-executed binding operation. For example, as indicated at 512, computing system 102 may assign and send a first unique ID to control device A and/or store the unique ID in order to bind control device A. Upon binding, control device A and computing system 102 may freely communicate with one another in any suitable manner, such as utilizing a particular RF channel. In other words, computing system 102 may allow control device A to issue commands once the binding operation has been performed.

As indicated at 514, computing system 102 may also request that control device B emit a particular beacon pattern. In some embodiments, the particular pattern may be the same as the particular beacon pattern requested at 508, while in other embodiments, the particular pattern may be different than the particular beacon pattern requested at 508. For example, if the particular pattern is the same, the computing system may associate a time value to the pattern to differentiate a pattern received from control device A from a pattern received from control device B. Conversely, if the particular pattern is different for each control device, the computing system may be able to differentiate two beacons received at the same time. After the request sent at 514, control device B may emit a beacon having an unrequested pattern at 516. For example, while the request sent at 514 may be intended for control device B, control device B may not receive the request. Therefore, control device B may emit a beacon having a pattern that is different than and/or outside of a similarity threshold of the pattern requested at 514. In response, computing system 102 may determine that reliable communication may not be established with control device B, and thus may not bind control device B. In some embodiments, computing system 102 may attempt to bind with control device B again by sending another request for a particular beacon pattern. In additional or alternative embodiments, computing system 102 may automatically bind with control device B after receiving a number of beacons from control device B that exceeds a threshold. In some embodiments, the threshold number of beacons may be one, such that computing system 102 binds a control device or other object immediately upon detecting a beacon emitted by the control device or other object.

In some embodiments, the methods and processes described herein may be tied to a computing system of one or more computing devices. In particular, such methods and processes may be implemented as a computer-application program or service, an application-programming interface (API), a library, and/or other computer-program product.

FIG. 6 schematically shows a non-limiting embodiment of a computing system 600 that can enact one or more of the methods and processes described above. Computing system 600 is shown in simplified form. Computing system 600 may take the form of one or more gaming consoles, personal computers, server computers, tablet computers, home-entertainment computers, network computing devices, mobile computing devices, mobile communication devices (e.g., smart phone), and/or other computing devices. For example, computing system 600 may include computing system 102 of FIG. 1.

Computing system 600 includes a logic machine 602 and a storage machine 604. Computing system 600 may optionally include a display subsystem 606, input subsystem 608, communication subsystem 610, and/or other components not shown in FIG. 6.

Logic machine 602 includes one or more physical devices configured to execute instructions. For example, the logic machine may be configured to execute instructions that are part of one or more applications, services, programs, routines, libraries, objects, components, data structures, or other logical constructs. Such instructions may be implemented to perform a task, implement a data type, transform the state of one or more components, achieve a technical effect, or otherwise arrive at a desired result.

The logic machine may include one or more processors configured to execute software instructions. Additionally or alternatively, the logic machine may include one or more hardware or firmware logic machines configured to execute hardware or firmware instructions. Processors of the logic machine may be single-core or multi-core, and the instructions executed thereon may be configured for sequential, parallel, and/or distributed processing. Individual components of the logic machine optionally may be distributed among two or more separate devices, which may be remotely located and/or configured for coordinated processing. Aspects of the logic machine may be virtualized and executed by remotely accessible, networked computing devices configured in a cloud-computing configuration.

Storage machine 604 includes one or more physical devices configured to hold instructions executable by the logic machine to implement the methods and processes described herein. For example, logic machine 602 may be in operative communication with a depth camera interface, such as an interface of depth camera 110, and storage machine 604. When such methods and processes are implemented, the state of storage machine 604 may be transformed—e.g., to hold different data.

Storage machine 604 may include removable and/or built-in devices. Storage machine 604 may include optical memory (e.g., CD, DVD, HD-DVD, Blu-Ray Disc, etc.), semiconductor memory (e.g., RAM, EPROM, EEPROM, etc.), and/or magnetic memory (e.g., hard-disk drive, floppy-disk drive, tape drive, MRAM, etc.), among others. Storage machine 604 may include volatile, nonvolatile, dynamic, static, read/write, read-only, random-access, sequential-access, location-addressable, file-addressable, and/or content-addressable devices.

It will be appreciated that storage machine 604 includes one or more physical devices. However, aspects of the instructions described herein alternatively may be propagated by a communication medium (e.g., an electromagnetic signal, an optical signal, etc.) that is not held by a physical device for a finite duration.

Aspects of logic machine 602 and storage machine 604 may be integrated together into one or more hardware-logic components. Such hardware-logic components may include field-programmable gate arrays (FPGAs), program- and application-specific integrated circuits (PASIC/ASICs), program- and application-specific standard products (PSSP/ASSPs), system-on-a-chip (SOC), and complex programmable logic devices (CPLDs), for example.

When included, display subsystem 606 may be used to present a visual representation of data held by storage machine 604. This visual representation may take the form of a graphical user interface (GUI). As the herein described methods and processes change the data held by the storage machine, and thus transform the state of the storage machine, the state of display subsystem 606 may likewise be transformed to visually represent changes in the underlying data. Display subsystem 606 may include one or more display devices utilizing virtually any type of technology. Such display devices may be combined with logic machine 602 and/or storage machine 604 in a shared enclosure, or such display devices may be peripheral display devices. For example, display subsystem 606 may include display device 124 of FIG. 1.

When included, input subsystem 608 may comprise or interface with one or more user-input devices such as a keyboard, mouse, touch screen, microphone, or game controller. For example, input subsystem may include or interface with control devices 104, 106, 108, and/or 109 of FIG. 1. In some embodiments, the input subsystem may comprise or interface with selected natural user input (NUI) componentry. Such componentry may be integrated or peripheral, and the transduction and/or processing of input actions may be handled on- or off-board. Example NUI componentry may include a microphone for speech and/or voice recognition; an infrared, color, stereoscopic, and/or depth camera for machine vision and/or gesture recognition; a head tracker, eye tracker, accelerometer, and/or gyroscope for motion detection and/or intent recognition; as well as electric-field sensing componentry for assessing brain activity.

When included, communication subsystem 610 may be configured to communicatively couple computing system 600 with one or more other computing devices. Communication subsystem 610 may include wired and/or wireless communication devices compatible with one or more different communication protocols. As non-limiting examples, the communication subsystem may be configured for communication via a wireless telephone network, or a wired or wireless local- or wide-area network. In some embodiments, the communication subsystem may allow computing system 600 to send and/or receive messages to and/or from other devices via a network such as the Internet.

It will be understood that the configurations and/or approaches described herein are exemplary in nature, and that these specific embodiments or examples are not to be considered in a limiting sense, because numerous variations are possible. The specific routines or methods described herein may represent one or more of any number of processing strategies. As such, various acts illustrated and/or described may be performed in the sequence illustrated and/or described, in other sequences, in parallel, or omitted. Likewise, the order of the above-described processes may be changed.

The subject matter of the present disclosure includes all novel and nonobvious combinations and subcombinations of the various processes, systems and configurations, and other features, functions, acts, and/or properties disclosed herein, as well as any and all equivalents thereof. 

The invention claimed is:
 1. A method, comprising: receiving one or more depth images from a depth camera, the depth images indicating a depth of a surface imaged by each pixel of the depth images; modeling a human subject with a virtual skeleton including a hand joint and a head joint; performing facial recognition constrained to an area around the head joint of the virtual skeleton to determine an identity of the human subject modeled by the virtual skeleton; recognizing within one or more of the depth images a beacon emitted from a control device; finding the hand joint nearest to the emitted beacon in three dimensions; and associating the control device that emitted the beacon with the identity of the human subject modeled by the virtual skeleton that includes the hand joint that is nearest to the emitted beacon.
 2. The method of claim 1, wherein finding the hand joint nearest to the emitted beacon includes identifying the control device within the depth images.
 3. The method of claim 1, wherein the depth images are received from a time-of-flight depth camera, and wherein associating the control device with the human subject based on a proximity of the control device to the human subject comprises associating the control device with the human subject if the human subject is within a threshold distance from the control device and not associating the control device with the human subject if the human subject is outside of the threshold distance from the control device.
 4. The method of claim 1, wherein performing facial recognition includes performing facial recognition to an area of an image captured by a visible light camera, the area corresponding to a location of the head joint.
 5. The method of claim 1, wherein associating the control device with the human subject includes attributing a gesture input of the virtual skeleton and a control input of the control device to a same source.
 6. The method of claim 5, wherein the control device is associated with the human subject if a position of the control device is within a threshold distance of the hand joint of the virtual skeleton.
 7. The method of claim 1, further comprising outputting to the control device a message requesting the control device to emit the beacon.
 8. The method of claim 1, wherein the beacon is an infrared light signal.
 9. The method of claim 1, wherein the control device includes a game controller and wherein the human subject is a first human subject, the method further comprising assessing a location of a second human subject, comparing a proximity of the second human subject to the control device relative to the proximity of the first human subject to the control device, and associating the control device to the second human subject based on the relative proximity of the second human subject to the control device.
 10. The method of claim 1, wherein the control device includes an audio headphone.
 11. The method of claim 10, further comprising adjusting an audio output of an application providing sound to the audio headphone based on a position of the audio headphone.
 12. The method of claim 1, wherein associating the control device with the human subject includes applying a settings profile of the human subject to an application controlled by the control device.
 13. The method of claim 1, wherein the control device includes remote control.
 14. The method of claim 1, wherein associating the control device with the human subject includes presenting user-specific content via an application controlled by the control device, the user-specific content selected for the human subject.
 15. The method of claim 1, further comprising tracking a position of the control device over time and interpreting control gestures from the tracked position.
 16. A method, comprising: recognizing a beacon output from a first object; finding a hand joint nearest to the beacon; modeling a first subject with a virtual skeleton including the hand joint and a head joint, the head joint and hand joint being connected to one another; performing facial recognition constrained to an area around the head joint of the virtual skeleton to determine an identity of the first subject modeled by the virtual skeleton; and associating the first object that output the beacon with the identity of the first subject modeled by the virtual skeleton including the hand joint that is nearest to the beacon.
 17. The method of claim 16, further comprising analyzing a parameter of the beacon to identify the first object of a plurality of different objects.
 18. The method of claim 16, further comprising: recognizing a beacon from a second object; assessing a position of the second object in three dimensions; and associating the second object with a second subject based on a proximity of the second object to the second subject.
 19. A computer, comprising: a depth camera interface for receiving depth images from a depth camera, the depth images indicating a depth of a surface imaged by each pixel of the depth images; a logic machine in operative communication with the depth camera interface; and a storage machine in operative communication with the logic machine, the storage machine storing instructions executable by the logic machine to: receive one or more depth images from the depth camera; model a human subject with a virtual skeleton including a hand joint and a head joint; perform facial recognition constrained to an area around the head joint of the virtual skeleton to determine an identity of the human subject modeled by the virtual skeleton; output to a control device a message requesting the control device to emit a beacon; recognize within one or more of the depth images the beacon emitted from the control device; find the hand joint nearest to the emitted beacon in three dimensions; and associate the control device that emitted the beacon with the identity of the human subject modeled by the virtual skeleton that includes the hand joint that is nearest to the emitted beacon. 